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In models where there is a global chiral symmetry which spontaneously breaks to its vectorial 
subgroup, the introduction of an explicit symmetry breaking perturbation will define the true vac- 
uum of the theory. This true vacuum is found via the minimization of the expectation value of the 
perturbing hamiltonian between different vacua as prescribed by Dashen. The procedure of finding 
the correct vacuum of the theory may result in the spontaneous breaking of CP symmetry even if 
one initially demands CP invariance on the perturbation. In this work we study in detail models 
where the perturbation is provided by four-Fermi operators. We present exact treatment for models 
with two fermion flavors and study the three flavor case in depth numerically. We show that after 
Dashen procedure is applied the solutions for the true vacuum fall in three classes with different CP 
breaking patterns. Critical transitions is possible between these classes as one varies the parameters 
of the perturbation. We rigorously show that at these transitions a pseudo-Goldstone boson mass 
vanishes. We also advocate, and substantiate with numerical statistical analysis for various types of 
models, that if one imposes CP invariance on the perturbation before solving the vacuum alignment, 
the resulting vacuum structure will have a sizable probability for a light pseudo-Goldstone boson 
mass. That is a statistical variant of Peccei-Quinn mechanism can be speculated to operate. 

PACS numbers: 12.60.Nz 



I. INTRODUCTION 

Spontaneous breaking of continuous symmetries is an important part of science, occurring in a wide range of natural 
systems. When it happens the system admits continuously degenerate vacua. It is then important to pick a vacuum 
around which to define excited states of the system. For example, a collection of spins located on the sites of a given 
lattice will have minimum energy when all are parallel. But if this is all there is to it one can not prefer one direction 
to the other. However in the presence of a magnetic field, no matter how small, this direction is mandatorily the 
direction of the field defining the true vacuum of the theory. In this case, one can not ignore the presence of the 
perturbation and work with an arbitrary vacuum since this will lead to unstable excitations. The determination of 
this true vacuum state compatible with the perturbation is called vacuum alignment problem. In certain models, 
vacuum alignment may also spoil discrete symmetries that are ordinarily defined with respect to a vacuum, chosen 
to be a standard in the absence of a perturbation. Such a counter-intuitive result is shown to happen by Dashen 
0, [IJ for the combined symmetry of charge conjugation and parity (CP), within the context of mass perturbations of 
quarks. He showed that there are classes of mass perturbations that are initially naively CP conserving with respect 
to a standard vacuum yet after vacuum alignment spontaneously break CP symmetry. 

In this work we will be studying vacuum alignment and spontaneous CP violation in theories where a global flavor 
chiral symmetry is spontaneously broken. We confine the perturbation that will explicitly break this chiral symmetry 
to the particular form of four- Fermi operators. Section II will present the model and the details of the Dashen program 
applied to it as well as the classification of CP breaking patterns. In Section III we present a mapping that simplifies 
the problem and allows a coherent exposition. Section IV is reserved to the phase transitions between different class of 
solutions as one varies parameters of the perturbing hamiltonian. We explicitly show that transitions between different 
class of solutions are accompanied with a vanishing pseudo-Goldstone boson mass. In Section V an exact treatment of 
the problem for two fermion flavors is presented. We also present a Monte-Carlo study where we have examined the 
statistical behavior of the pseudo-Goldstone boson masses and the CP breaking patterns for various types of models. 
As a result of this statistical study we also argue that an almost vanishing pseudo-Goldtone boson mass, that is fairly 
small compared to the largest one, is respectably probable. We argue that this result is closely connected to the fact 
that one imposes ab initio CP invariance on the perturbation. This effect bears a resemblance to the Peccei-Quinn 
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mechanism [12], |13[ where the anomaly parameter 9, a measure of CP violation in strong interactions, is promoted to 
be a held and is given an extra global U(l) symmetry that spontaneously breaks for 6 = 0. This consequently yields 
a Goldstone boson: the axion [3, EE]- However, the Peccei-Quinn mechanism is exact and our analogy is valid only 
in the mean within a large collection of models. In the last section we discuss future directions to enlarge the present 
work. 



II. THE MODEL 

Consider a collection of massless Dirac fermions with i = 1, • • • ,N. We assume that these fermions interact 
via a flavor blind, vectorial strong interaction described by a hamiltonian Ho- As is well known Hq admits a chiral 
flavor symmetry Gc = SU(N)l x SU(N)r . Let us assume that the interaction becomes strong and confining at some 
scale At thereby triggering the spontaneous breaking of Gc down to its vectorial subgroup Gy = SU(N)v, similar to 
the mechanism in QCD. The ground state of the theory is degenerate under this vectorial subgroup. This continuous 
degeneracy manifests itself as the appearance of massless excitations in the spectrum of the theory, namely the 
Goldstone bosons. The condensate characterizing this spontaneous breaking can be described as 

(0|** £ ^|0) = -A T S* (1) 

Here At ~ 2irF 3 and F is the decay constant of the Goldstone bosons of the theory assuming a similarity with QCD 
[2(ij |. We also have At ~ AizF. The state |0) is called the standard vacuum and is invariant under Gy- Using this fact 
one can also show that the equation defining the condensate is invariant under Gy. 

However ([1]) is not invariant under the action of the full group Gc nor under the action of the co-set space Gc / Gy 
characterizing the Goldstone bosons. In this case we have 

(0|#^|0) = -A T WV (2) 

with ipL,R — Wl,r^ l,r and W = WjWr, where W can also be thought of as parameterizing the Goldstone boson 
fields in a non-linear sigma model approach. This is the active point of view. One can also see this as written with 
respect to another vacuum \ 

(n\¥ L & R \n) = -A T w ij (3) 

where |fl) = t/(VK)|0) and U(W) is a unitary operator acting on the collection of ground states and carrying one to 
another. This is the passive point of view. 

Now, what happens if one introduces a perturbing hamiltonian Hi on top of Ho? The general answer to this is 
that one has to apply the Dashen's procedure [l[ which in principle says that not all hamiltonians are compatible 
with the standard vacuum. That is, not all hamiltonians admit the standard vacuum as the state which minimizes 
the energy. We then have the Dashen's theorem which says that the compatible vacuum state is defined by the state 
which minimizes the following 

e(w) = (n(w)|JTi|n(W0) = {o\u*(w)h 1 u(w)\o) (4) 

Of course one can see this equation from both points of view. The active one which means that one rotates the 
hamiltonian so that it becomes compatible with the standard vacuum, or the passive one which means that one finds 
the vacuum among degenerate vacua (2lj which is compatible with the hamiltonian. In this work we will use the 
active point of view. In this point of view Dahsen's second theorem states that for all the hamiltonians compatible 
with the standard vacuum the vacuum energy is minimized at W = I. That is, defining the rotated hamiltonian 

H[ = UHW)H 1 U(W) (5) 
to be a compatible hamiltonian, one arrives at that 



E{W) = (0\U\W)H[U(W)\0) 



(6) 
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is minimized at W = I. We have U(I) — X, with X representing the identity operator. 
Let us now assume that the perturbation has the following form 



ff 1 =A tfW *iy*iH7M*k (7) 

where the summation convention is understood on repeated indices. We use this convention throughout this work. 

The emergence of a perturbation of the type ([7]) can for instance be motivated in technicolor theories (describing 
in our language Hq) enriched with extended technicolor which is itself a gauge theory (see for instance [|[ for an 
introduction to technicolor and extended technicolor). Here to keep the analysis short we do not go into the details of 
motivating this any further, a new interaction that couples the left-handed currents to right-handed ones is sufficient. 
If this is the case the object Aj»jy will be proportional to the inverse mass matrix of the gauge bosons of the new 
interaction which presumably breaks at a higher energy scale than the scale at which Hq creates a condensate [22j . 
Of course in such a scenario one would just as well have perturbations that couple two left(right)-handed currents, 
but these will have vanishing expectation values in any of the degenerate chiral vacua (23j |. 

Without going into the details of the calculation the Dashen procedure applied to this problem means that one has 
to minimize the following quantity 



E(W) = -A TT A ijkl W^ H W jk (8) 

subject to the fact that W = W^Wr is a SU(N) matrix. In a QCD like theory Att ~ Aj,, but from now on we will 
discard the dimensionful constants and study with the appropriately normalized dimensionless objects. In this case 
the energy function simply becomes 



E(W) = -A ijkl W^ l W k (9) 

where now A's are dimensionless. Furthermore, since the perturbation explicitly breaks the chiral symmetry the 
Goldstone bosons will acquire masses. The mass matrix of these pseudo-Goldstone bosons (PGB), which is essentially 
the second derivative of the energy at the minimum, is given by the following 



Mi 



2A 



ijkl 



({\ a: x b }w^) w 3k + w^ u (w{\ a , x b }Y k - 2 (x a wT (wx b y k - 2 (X b W^) (WX a ) 



(10) 



Here A a 's represent the generators of SU(N) in the fundamental representation, and {a, b} = ab + ba is the anti- 
commutator. 

The minimization procedure applied to E(W) does not determine Wl and Wr separately and hence the rotated 
hamiltonian is not uniquely defined. However this degree is due to the fact that the standard vacuum in invariant 
under the vectorial subgroup. One can, as a convention, take W = Wl without loss of generality. In this case the 
rotated hamiltonian becomes 



A w ljkl = A 1 ' ] ' kl w^ il 'W 3 ' j 



(11) 



In terms of this rotated hamiltonian the PGB mass matrix becomes 



M 



2 ah 



2Af l [{X a , X b } H S jk + S H {X a , X b } jk - 2XlX h 0k - 2X h u X%] 



(12) 



Conditions on A^y: The hermiticity of the hamiltonian requires we impose the following 



ensuring that E(W) — E*(W). One can also impose time- reversal invariance on the hamiltonian, which is equivalent 
to imposing CP invariance in view of the CPT theorem. This means that we must have 



A' 3 "' e n 



(14) 
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that is they are real. This invariance mandates 

E(W)=E(W*) (15) 

In terms of the rotated hamiltonian this will read 

A«*f = (A« w )*=Ai« fc (16) 

Now, if W W* CP will be spontaneously broken. The philosophy behind this idea goes back to Dashen's work, 
where he showed that one can have complex numbers in a theory even if initially one imposes reality via T invariance. 
The complex numbers arise as a result of the fact that one has to rotate the hamiltonian to become compatible with 
the standard vacuum. One can also see this from the passive point of view. One defines CP invariance in the standard 
vacuum, yet the hamiltonian is compatible with another vacuum for which the original CP transformation may not 
necessarily ensure invariance. Dashen showed this effect to occur in a very simple model of quark mass perturbation. 
Even though the mass matrix he proposed was real it nevertheless had a non- vanishing phase for the determinant. 
That is, albeit there argdet(M q ) = it, the contribution to the chiral anomaly term was non-zero and consequently 
one has relevant complex phases (that is phases one can not rotate away via simple redefinitions of the fields) in the 
theory. This idea was promoted |4j to technicolor and extended technicolor to study CP violation via chiral vacuum 
alignment. The model was later studied in || and to somewhat larger extent in [y, 0, S]- In this work we aim at 
enlarging the understanding of this model. 

A. Rational Phase Solutions 

In [U it was discovered that the minimization of @ gives three classes of solutions 

• W = W* , here W is simply an orthogonal matrix, CP is not spontaneously broken. 

• W W* with arbitrary complexity, CP is spontaneously broken arbitrarily. 

• W j£ W* where all the phases of W are rational multiples of tt, CP is spontaneously broken with a pattern. 

In these solutions are called CP-conserving (CPC), CP-violating (CPV) and pseudo CP-conserving (PCP) re- 
spectively. We will follow this naming scheme. This classification is also valid for the rotated hamiltonian (jTTj) as 
was shown in 5]. That is, in a CPC phase the rotated hamiltonian is real, in a PCP phase it has phases that are 
rational powers of tt and in a CPV phase its phases are irrational multiples of n. There is an exception to this, if 
all of the complexity of W comes as an overall phase the rotated hamiltonian will always be a CPC one. An overall 
phase containing all of the complexity means that one has W — e %x O with O real and orthogonal and this will fix \ 
unambiguously depending on whether detO is 1 or — 1. 

At this point we would like to mention that since the energy function is quadratic [24| in W every solution can be 
multiplied with a matrix exp(2irik / N) for k = 0, 1, • • • , N — 1, to generate a new solution. In short every solution is 
degenerate under Zn, the center of SU(N). With this remark we understand that if the complexity of W is carried 
by an overall phase this phase must be and element of Zjq or s/Zjj. 

The possibility for CPC and PCP phases is understood via the phase locking mechanism as explained in detail in 
[H, 0]- This mechanism is present because the energy function is quadratic in W. To review the idea let us rewrite 
the energy as follows 

E = -A ijkl \W ti \\W ik \e i &"'-+ 1 *) (17) 

If for a particular ijkl, Aijki > the contribution of this term to the total energy will be minimized if the phases of 
W b and obey <f)jk — 4>u or if Ayjy < the contribution of this term to the total energy will be minimized if 
4>jk = 4>U + 1"- This is the phase locking mechanism. However since W is constrained to be an SU(N) matrix this 
cannot happen for all the elements, that is the phase locking may or may not be frustrated by the unitarity and 
unimodularity of W. But when it is allowed this is the mechanism that triggers the solution to be in the CPC or 
PCP phases. If the mechanism is not operational at all for a given model the solution will be a CPV one. 
Let us consider the following parameterization 
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W = D 1 W C kmD 2 (18) 

where D\ and D2 are diagonal SU(N) matrices and Wqkm is a Cabibbo-Kobayashi-Maskawa type matrix. This 
representation is called the Harari-Leurer representation Wckm generally has phases which also infest the 

norms of its elements. That is not all the elements of Wqkm have norms and phases that are independent of each 
other. This is an obstruction for phase locking. In fact in [5j it was shown that for phase locking mechanism to occur 
one must have 



W = D x OD 2 (19) 

where O is an SO(N) matrix. In this case the phases of all the elements of W are independent degrees than those 
constituting the norms of W, consequently phase locking can not be frustrated. 

SU(N) representations are in general complex. However there are cases where the complex conjugate of an element 
may be obtained by a conjugation with another element of the group. In this case one has 



W* = g- L Wg (20) 

and one says W is a pseudo-real element. We would like to recall that since the energy function we are dealing with is 
quadratic in W and thus that any solution to minimization is defined modulo Zn, the above equation must actually 
read 

W* = Z N g- x Wg (21) 

It is clear that for a generally complex unitary matrix this condition can not be fulfilled. However the special form of 
(dnj) allows for such a relation between W and W*. The condition that has to be satisfied is 

D{OD* 2 = Z N g- 1 D 1 OD 2 g (22) 

meaning 

= Z N D 1 g- 1 D 1 OD 2 gD 2 (23) 
If O is a general SO(N) matrix, for the condition above to hold one must have 



_1 = Z N Df (24a) 
g = Df (24b) 



and one ends up with the following 



(DxD 2 f = Z N (25) 



which in turn yields 



D X D 2 = Kexp(kTTi/N) (26) 

Here k — 0, 1, • • • , N — 1 and if is a diagonal real matrix with K 2 — 1, that is K is diagonal with ±1 as entries. Of 
course we must remember that D\ and D 2 are diagonal SU(N) matrices which imposes det(DiD 2 ) = 1. 

This analysis shows the emergence of rational phase solutions as was shown in 0, @, 0| > albeit via means of numerical 
analysis. To get insight into its meaning let us consider (|26p with k — for simplicity. If we take 



(DM 2 = I £i£ 2 = K 



(27) 
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and consider the following 

D x O a D 2 D x O h D 2 = D x O c D 2 (28) 
one can show that defining a new product rule 

O a *O b = O c (29) 

with O a -kOb = O a D 2 DiOb, all the axioms of group theory are satisfied. That is the form D\O a D 2 with the product 
defined as mentioned constitutes a group. Therefor various forms of D\D 2 satisfying the pseudo-reality condition (|2~lj) 
characterize different embcddings of O(N) elements into SU(N) in the form of DiO a D 2 . 

However these embeddings should not in general be called subgroups since they have a group structure with respect 
to a different product rule than that of the group they are part of, namely SU(N) with the product defined as the 
usual matrix multiplication. The only one among these embeddings that is a subgroup is the one for K = I, in which 
case one really has a O(N) subgroup. Furthermore embeddings with K I are not connected to the identity element 
of SU(N) since their identity with respect to the product rule ★ is the matrix K. Within the jargon of group theory 
these embeddings can be called orbits. In the present context we call them O(N) type orbits. 

Further insight and new conditions will arise if we study the complex conjugate of these orbits. That is if 

D 1 O a D 2 (30) 

with D 1 D 2 = K characterizes an orbit so does 

DlO a D* 2 (31) 
In general one does not expect these two orbits to coincide. If they do however one must have 

D{O a D* 2 = D 1 O a D 2 (32) 

meaning that 

O a = DfO a Df (33) 
in view of the reality of orthogonal matrices this translates to 

Dl = Ki (34a) 
D\ = K 2 (34b) 

Where K\ and K 2 are iT-type matrices mentioned before. Combining these with (|27|) we get the following 

D\=Dl = K' (35a) 
D X D 2 = K (35b) 

where K' is again a if- type matrix. From this it is clear that possible 7r/2 phases will enter the game regardless of 
what SU(N) we are dealing with. 

So far we have dealt with the full O(N) matrices in D\OD 2 . But what happens if one actually has an 0(N— 1) matrix 
as a solution? In this case the rational phase structure is further enriched. Let us go back to the pseudo-complexity 
condition 

O = Z N D l g- 1 D 1 OD 2 gD 2 (36) 

If O is an element of O(N-l) the diagonal matrices D\ and D 2 are defined modulo Zat_i since this will commute with 
O and in reality we have the following 
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O = Z N Z N - 1 D l g- 1 D 1 OD 2 gD 2 (37) 

bringing more possibilities into the space of rational solutions. 

Our analysis so far means that for SU(N), rational phase solutions yield a W that has phases of the following form 

n ^ ±n 1 - ±n2 N 7T _ 2 ±---±n N ^ 1 n (38) 

where are natural numbers. 

To summarize our results we restate that rational phase solutions fall into O(N) type orbits of SU(N) as described 
above and that the orbits have group structures. Only one of those orbits is an actual subgroup of SU(N) with respect 
to the product rule of SU(N), the others are just subsets with a group structure defined with respect to another 
product rule. The orbits are characterized with K which in the most general case is a diagonal matrix and obeys 
K 2 = I. Thus in general one has a discrete set of orbits characterizing the rational phase solutions. In terms of our 
classification CPC solutions are the O(N) subgroup, PCP solutions are in the orbits with K ^ I and CPV solutions 
are generic elements of SU(N). 

In short the main reason for the possibility of rational phase solutions is that the energy function we are minimizing 
does not care about the overall phase of W . This also gives rise to the phase locking mechanism as presented above. 

As a final remark we would like to add that if a A^-jy gives a rational phase solution for a given D\ and D 2 a small 
perturbation of the model is likely to remain in an embedding characterized with the same Di and D 2 (modulo the 
overall Zn ambiguity). That is under a small perturbation of the hamiltonian one is likely to remain in the same 
orbit. We will elaborate on this in section [TV] 

1. Physical implications of rational phase solutions: 

In the previous section we have exposed the mathematical existance of rational phase solutions. Their physical 
relevance will generally depend on the particulars of the model. First of all in the interpretation of the spinors, what 
are those spinors? Second in the physical nature of the strong interaction that, by growing strong at low energies, will 
spontaneously break the global chiral flavor symmetry. Finally on the meaning of the perturbation that will explicitly 
break the chiral symmetry. 

One interpretation that is known to the author is that of works [5|,[6|. In simplified terms, there the spinors 
were technifermions, the strong interaction is technicolor and the explicit symmetry breaking is achieved by extended 
technicolor. The technifermions were also coupling to the quarks via extended technicolor four- Fermi interactions and 
thus provide a mass term for quarks which will in essence contain all the information necessary for the low energy 
theory. The use of rational phases in that interpretation was to naturally explain the observed fact that in the quark 
sector strong CP violation is absent but weak CP violation is accomodated. Or to put it in honest terms, for the 
phenomenology of extended technicolor the CPV cases are bad since they are in conflict with no strong CP violation 
and the CPC phases are unable to introduce a weak CP violation (the phase in the CKM matrix of quarks). The 
rational phase solutions opens up an avenue to circumvent these phenomenological obstacles. 

There could be various other physical cases where the rational phase solutions may prove to be interesting. For 
instance it is tempting to speculate that they may find applications in solid state phenomena such as fractional 
quantum hall effect. As the purpose of this work is to lay out the general mathematical structure we will not pursue 
any of these motivations. 

III. THE ADJOINT MAP 

It is possible to decompose the object A^ kl into the following form 

A ijkl = A ab^ x kl ( 3Q ) 

Where A's represent the generators of SU(N) in the fundamental representation enriched by Ao = I/y/2N with I 
representing the identity matrix. So we have N 2 of them. The possibility for such a decomposition is clear since 
the number of independent elements of A ijfci if they are allowed to be complex is 2N A which matches the number of 
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independent entries in A afc if they are allowed to be complex. In fact one can easily show that the decomposition is 
invertible. That is, given a A l: > kl one computes A a & as 

A ab = 4A^' k ' l '\( l '4 k ' (40) 
which can be reinserted in (139p and using the well known identity 

x ij X kl = Igilgik (41) 

one can show that the decomposition is one-to-one. We remind the reader that in (|41[) A is included in the sum over 
the generators. 

One can also understand the validity of the mentioned decomposition by remembering that the hamiltonian consists 
of operators of the form 

*i7 /i *i*fl7 M *fl (42) 
which transforms as the (Nl* x Nl) x (N r * x N r ) representation of SU(N) L x SU(N) R . This product decomposes as 

Ad L x Ad R Ad L x 1 R ® 1 L x Ad R © 1 L x 1 R (43) 

where Ad denotes the adjoint representation of SU(N) which has dimension N 2 — 1. The first term in (|43|) represent 
the elements without Ao for the decomposition in (|39p . The second and third terms represent contributions from A a o 
and Aoa and the last one from Aoo respectively. 

In terms of this decomposition the vacuum energy becomes 

E = -A ah Tr [wUaWAb] (44) 

from which we realize that the terms involving Ao are irrelevant. The terms in AdL x 1 R and 1l x Ad R do not contribute 
to energy at all. The term 1l x 1 r contributes only a constant which will be immaterial for the minimization procedure 
. So we can from the outset ignore the effect of Ao and assume we only use the fundamental generators of SU(N) in 
(|2H]) where the decomposition is defined [26| . In view of the obvious connection we would like to call the decomposition 
mentioned above the adjoint map of the problem. 



A. The independent elements of A which are relevant 



The hermiticity condition on A^ kl means 



( A jilky = A tjkl ^ ^ g R ( 45 ) 

as can be easily checked from (|3"5|) . So a generic model will have (N 2 — l) 2 relevant parameters. 

When however time-reversal symmetry is assumed in the original hamiltonian, that is A lJ ' H £ R, the parameters 
are further constrained. The algebra of SU(N) has N 2 — 1 elements; the Cartan subalgebra consisting of real diagonal 
generators of cardinality N—l and two sets consisting of real symmetric and pure imaginary anti-symmetric generators, 
both of which having N(N — l)/2 elements. In this language the combined effect of the hermiticity and reality 
conditions on A 1 ^ 1 simply means that A ab are real and do not have elements that mix real symmetric and pure 
imaginary anti-symmetric spaces of the generators. This means that in a suitable ordering of the generators A a , A afc 
has the following form 



Aijfl 








A// 



9 



We will always assume such an ordering is made and use it in the remaining part of this work. The upper left 
corner represents the space of real symmetric generators and the lower right corner represents the pure imaginary 
anti-symmetric generators. We also assume that within the set of real symmetric generators, the generators of the 
Cartan subalgebra is placed first. 

Finally from these considerations, it is clear that after the reality and hermiticity conditions are imposed on A 1 ^ 1 , 
the number of relevant degrees of freedom is 



N(N — 1) 



1 2 



+ 



N(N - 1) 



iV 2 (iV 2 + l) 



N(N+l) + l 



instead of the naive N 2 (N 2 + l)/2 disregarding the parts of A*- 7 '*' which does not contribute to the vacuum energy. 

B. The Formulation of the Problem in the Adjoint Map 

Let us remember that in the adjoint map the energy function reads 



E = -A afc Tr [W f A a WAb] 



(46) 



This form of the energy function has a special meaning. The operation W^A a W induces an automorphism of the 
SU(N) algebra which does not change its structure. In fact one has 



W^X a W = S ba X b 

S ba = 2Tr [W^X a WX b ] 



(47a) 
(47b) 



This means that if W = cxp(iu; a X a ) one has S — exp(iQ a (w)T ) where T a is a generator in the adjoint representation 
of the algebra, that is {T a ) bc = if a bc- From these defining equations of S it is easy to show the following 



^aa' ^bb' ^cc' j a'b 1 c' 
S aa' Sbb' S cc > d a ' b> c' 
S a a' Sbb'$a'b' 



fabc 
dabc 
Sab 



(48a) 
(48b) 
(48c) 



similar relations will hold for all the invariant tensors of the algebra, in particular it is clear from the above that S 
must be an orthogonal matrix with unit determinant. 
Written in this form, the energy function reads 



E 



-■^A ab Sba 



--Tr[AS] 



(49) 



One can carry all the necessary formulas from the original setting to the adjoint map and in general they are much 
simplified. For example the rotated hamiltonian in the adjoint map can be computed from its definition in the original 
setting 



A% kl = A^' kl wl,W fj 



(50) 



which results in 



A m = A ab (w\ Xa wf xf = A ah Sb C x^x 

Thus in the adjoint map the rotated hamiltonian is represented as 



kl 



(51) 



A s = A = AS* 



(52) 
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The equivalence of the two points of view is evident since tt a defining S and uj a defining W are related. So one may 
look at Q a as defining uj a (Q) and use the standard Dashcn procedure in the adjoint map [271 ]. 
The extremization condition is simply, 



Tr AT a = 



(53) 



We would like to emphasize that since T a are antisymmetric this is a condition on the antisymmetric part of A. One 
can show that the anti-symmetric part of the perturbation in the adjoint map represent the part that breaks parity 
symmetry. So the extremization condition is a constraint on parity breaking [28| 
The pseudo-Goldstone boson mass matrix is given as 



Mt b = -Tr 



A(T a T b + T b T a ) 



(54) 



The antisymmetric part of A does not contribute to this (neither it does to the energy) so one can also write is as 



M z ab = Tr 



AT a T b 



from which we also get 



E 



N 



Tr [M 2 ] 



(55) 



(56) 



The usefulness of the adjoint map becomes clear if we perceive that a quadratic functional became a linear one and 
consequently the extremization condition as well as the PGB mass matrix take on a particularly simple form. 

C. Zn degeneracies and the adjoint map of W* 

It is important to understand how the degeneracies of the original problem is changed after the adjoint map. If we 
remember the defining relation for S 



S ba = 2Tr [W^X a WX b ] 

we see that the Zn degeneracies disappear. That is if W is mapped into S then so is Z^W. 
As for the degeneracy E(W) = E(W*), if W is mapped to S(W) then W* is mapped to 



(57) 



S(W*) ba = 2Tr [W T X a W*X b ] 
By using the properties of the generators A a one can show that if one has 



(58) 



then one has 



S(W) = 



t 

Srr 


Sri 


Sir 


Sn 



S(W*) 



Srr. 


—Sri 


—Sir 


Sn , 
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The presence of a nonzero Sri and Sir means that the related W is in general complex. If Sri = and Sir = 
the implication is that the rotated hamiltonian A l ffi l is not a CPV one. 

It is worth mentioning here that the effect of complex conjugation of W can be represented in the adjoint map as 
a similarity transformation in 0(N 2 — 1). That is, one has 

S(W*) = C- 1 S(W)C (59) 

with 



c = 


( 

I 


t) 




V o 





An original hamiltonian yielding a A such that Arj — and Air = (meaning that A yfci are real) will commute 
with this and so the fact that S(W*) and S(W) yielding the same energies will remain valid, as expected. This is 
again benefit of the use of the adjoint map, it transforms complex conjugation in the original framework to a simple 
form. But, of course, the matrix C lives in 0(N 2 — 1), outside the space where S resides. 

1. Correspondance to the Non-Linear Sigma Model Language 

The adjoint map also makes a better connection to non-linear sigma model realization of the four-Fermi interaction 
we are studying. To make our formalism apparent in that language all we have to do is to remember that W — W l Wr 
is really parameterizing the Goldstone boson fields around the standard vacuum. Returning (for this discussion) the 
dimensionful parameters to their proper places we have W = exp [iTT a X a /F] = £ and the potential for the sigma field 
becomes 

- A T A ob Tr [sUaSAb] (60) 

The vacuum alignment procedure is equivalent to find the proper E Q around which the pion tadpoles are vanishing 
and pions have positive semi-definite masses. Such a term will only be relevant if it explicitly breaks the chiral flavor 
symmetry. Let us consider an explicit example of a LR symmetric model. In this case one has Aijki = A4ijAiki/M 2 
with M G 1Z and M T = M. The potential reads 



A. 



^Tr [tfMVM] (61) 

and it is obvious that for a generic M. the proper vacuum is given as S = /. Expanding around this we see that the 
PGB mass matrix term is 



A 



2APF 2 



2 

7T 7T 



'Tr [X a MX h M] (62) 



We remind the reader that F is the decay constant of the goldstone bosons, At ~ 2ttF 3 is the value of the fermion 
condensate and M 2 is the scale of the four-Fermi interaction that explicitly breaks the chiral symmetry. 

So in essence the adjoint map is close to the non-linear sigma model approach and is less cluttered than the 
A^ki notation. All one has to do is to study the behaviour of (|6T))) and this is what we are doing in essence. The 
transformation we have presented in (|47[) (with the properties in (|48p understood as constraints) is simply rephrazing 
the non-linear sigma model in another language which provides technical ease. This is valid since physical observables 
of non-linear sigma model is reparameterization invariant. 
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IV. DIGRESSION ON TRANSITION POINTS 

Consider the following functional 

E(x) = ~Tr[A(x)S(x)} (63) 

where S(x) is the matrix that minimizes the energy for a given A(x). As we change x we will vary A(x), that is we 
follow a path in the space of hamiltonians. Thus the vacuum energy E{x) and the corresponding S{x) will change. 

Every now and then the excursion along x traverses regions in the solution space where the corresponding rotated 
hamiltonian is real, arbitrarily complex or admits only rational phase complexity. That is, in the language presented 
before one may change between CPC, CPV or PCP phases. One may traverse between any two regions. As we have 
discussed, the rational phase solutions CPC or PCP are related to discrete sets of orbits (different cmbcddings of O(N) 
into SU(N)), whereas a CPV phase lies in the bulk of the group space. Consequently, transitions between CPC and 
PCP (or between different PCP) phases are discrete changes in W and hence in S. That is, S changes discontinuously 
at the transition point. There is no such argument from the passages from CPC or PCP to the CPV phases, and 
one expects a continuous transition. That is, S is changing continuously as one traverses the critical x. In this last 
case one expects a discontinuity in the derivative of S because in CPV phases the solution is an arbitrary element of 
SU(N) but in CPC or PCP phases the solutions are in 0(N)-type orbits so one requires less number of generators to 
represent the group element and the number of generators representing a group element is related to its derivative. 

Another, albeit non-technical, way of seeing this structure is the following. CPV phases form the bulk of the 
solutions in the group space of SU(N) whereas CPC and PCP phases are different islands representing different 
embeddings of O(N) within the sea of CPV solutions. One may jump from island to island and this represents a 
discontinuous change in S. Whereas one may take a swim from an island or get ashore an island from the sea of CPV 
solutions. This second case represent a continuous change in S since the sea and the shore are close in group space. 
But there nevertheless is a discrete change, one is either swimming or standing on an island. This change represent 
itself in the first derivative of S [29] . 

The aim of this section is to show that at a continuous or discontinuous transition point at least one PGB mass 
vanishes. We will assume that A(x) is a smooth function. That is, all of its derivatives exist. We follow a constructive 
scheme to prove the summary proposition which is at the end of this section. 



Proposition 1: The quantity Tr A( 



, x \dS(x) 



dx 



vanishes for all 



This follows immediately from the fact that the extremization condition reads 

Tr [A(x)S(x)T a ] = (64) 

and the fact that the derivative of S — exp(if2 a (x)T a ) can always be represented in terms of the tangent space (the 
algebra), 

— — = S(x)T a G ab {il)— — (65) 
dx dx 

where G is a general invertible matrix. As a corollary we have the following 



dx 2 r 



Proposition 2: E(x) is a continuous function of x. 



d ^pS(x) 
dx 



(66) 



This follows from the smoothness of A(x), for if E{x) is assumed to have a discontinuity then dE/dx must have a 
Dirac delta singularity. But if A (a;) is smooth, dE/dx given by the corollary above is bounded from both above and 
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below since S(x) takes values from a compact space. This contradicts the assumption that E(x) is discontinuous. 

Proposition 3: The behavior of S(x) near a point x = is determined by the PGB mass matrix at x = 0, 5(0) 
and dA/dx at x = 0. . 

Let us consider a point near x = where S(x) is given to be smooth. We can certainly expand A(x) = Aq + xAi 
since it is smooth. If S(x) is smooth we can also expand it as S(x) = 5(0) (/ + xs a T a ). The extremization condition 
is 

Tr[(A + xA 1 )S(0)(I + xs a T a )T b }=0. 
and will give the following matching the like powers in an expansion up to first order in x, 



Tr [A 5(0)T fc ] = (67a) 

Tr [Ai5(0)T a ] = Tr [A S(0)T a T b ] s b (67b) 

The first expression merely reemphasizes the extremization condition for x = 0. The second equation can be rewritten 
in terms of the PGB matrix at x = 



Tr [AiS(0)T„] - M a 2 h (0)s h (68) 

This is an equation for s a and thus determines how 5 will change. It is clear that if the left hand side does not 
vanish (which is not expected for a generic Ai) there should be no zero eigenvalues in the PGB mass matrix, since 
one need to invert it to find s a . If however the left hand side vanishes, we either have s b = 0, that is the solution 
for minimization does not change, or s b ^ 0. For the latter case there must be at least one zero eigenvalue and thus 
s b must be an element of the null space of the PGB mass matrix. So the vanishing of Tr [Ai5(0)T(,] may trigger 
a degenerate minimum borrowing the jargon from Morse theory. It is worthwhile noting that this condition is the 
extremization equation for a rotated hamiltonian consisting of Ai5(0) alone. Of course if Ai is proportional to A 
the solution must be s a = 0. 

Since in this corollary we have assumed that S(x) is smooth we have to exercise a little more care in proving the 
existence of a vanishing PGB mass, the next two propositions deal with this. 

Proposition 4: If S(x) changes continuously at x — but its first derivative has a discontinuity, then at least one 
PGB mass must vanish at x = 0. 

If we only have the continuity of S(x) near x = we can expand it to the left and right as follows 



S{x > 0) = 5(0) (J + xs+T a ) (69a) 
S{x < 0) = S(0)(I + xs~T a ) (69b) 

Then the extremization conditions to the left and right neighborhoods of x = will give the following 



Tr[A 5(0)T 6 ] = (70a) 
Tr[A!5(0)T b ] = M 2 fc (0) S + (70b) 
Tr[A!5(0)T b ] = M a 2 h (0K (70c) 

Again the first equation is just the fact that x = is an extremum, which is given from the outset. The last two 
equations can be used to give the following 

M 2 ab (0)(s+ -s-)=Q (71) 

Since by assumption =^ there must be at least one zero eigenvalue in the PGB mass matrix for a solution. It 
is clear from these considerations that both s + and s~ must live in the null space for if otherwise they are to have 
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any other part not in the null space one has to invert the PGB mass matrix to find it [3C 
condition that has to be satisfied is 



This means that another 



Tr [Ai5(0)T 6 ] = 



(72) 



Finally we would like to remind that since the PGB mass matrix is given as Tr [A(x) S '{x)T a T^ it changes continu- 
ously at a transition where S is continuous. Thus the vanishing masses at the transition go to zero continuously. 

If a discontinuity arises at the n'th derivative of S(x), that is, if all the derivatives up to and including the n-l'th 
derivative are continuous, the condition on the PGB masses does not differ from proposition 3. This type of transitions 
do not impose a vanishing PGB mass but puts restrictions on higher rank tensors since in this case one has to expand 
up to n'th order in x. 

Proposition 5: If S(x) changes discontinuously at x = then at least one PGB mass must vanish at x = 0. 
If S(x) changes discontinuously at x — we may pick 



S{x) = S-{x) + [S+(x) - S~(x)] G(x) 



(73) 



where without loss of generality we can assume that S (x) and S + (x) are smooth at x — 0, and thus can be expanded 
in x. As a result of our assumptions at x = there is a degeneracy in the minimization problem [3l| 



Tr [A(0)S+(0)] = Tr[A(0)S- 
Tr [A(0)5+(0)T o ] - 
Tr[A(0)S-(0)T o ] = 

Let us remember that the extremization condition holds for all x, 



(74a) 
(74b) 
(74c) 



Tr[A{x) S(x)T a ] =0 
thus we are allowed to take the derivative of this function and this gives 



(75) 



Tr 



fs(x)T a 
ax 



Tr 



A(x)«T Q 
ax 



= 



(76) 



plugging in our ansatz in (|73[) and using the conditions in (|74[) one can show that 



Tr 
Tr 



^W+(0)T a 
ax 

^(0)5"(0)T a 
ax 



= 
= 



(77a) 
(77b) 



which by virtue of proposition 3 means that we must have at least one vanishing PGB mass. 

So a discontinuous transition in S(x) is accompanied by a vanishing PGB mass. This makes a lot of sense if we 
remember that for a small change in x to induce a large change in S(x) there has to be a direction in which the 
energy is not changing, which is another way of saying that a PGB mass vanishes. The situation is very similar to 
the minimization of the function 



f(X, Y) = -n 2 {X 2 + Y 2 ) + X(X 2 + Y 2 ) 2 + eX 



(78) 



with /i 2 > and A > 0. The true minimum lies along the X axis for e ^ 0, let us call it X a . Then, the sign of 
X Q is correlated to the sign of e. When one traverses e = continuously X a changes discontinuously but at e = 
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the minima is continuously degenerate along the angular direction, allowing the solution to move along it with zero 
energy cost. 

Corollary to Proposition 5: Let us recall that the PGB mass-squared matrix is given as 



Tr [A(x)S(x)T a T h ] (79) 

At a discontinuous transition this matrix will presumably change discontinuously, thus the PGB masses will also 
change discontinuously. As we have shown at least one PGB mass must vanish, however we can not know if this zero 
happens as a discontinuous drop in the mass or if it goes to zero continuously. All we know is that some masses may 
shift discontinuously. Nevertheless there is a condition on the shifts. Let us recall that the energy is alternatively 
given as 

E{x) = -^[M\x)] (80) 
and that it is a continuous function regardless how S changes. Thus at a discontinuous transition we have 



Tr [M 2 (0~)] = Tr [M 2 (0+)] (81) 

So the relative shifts to the left and right balance. Not all masses can shift up or down, some must go up and some 
must go down in such a way that the sum of the square of the masses remains the same. 

Therefor we arrive at the corollary, if there is a discontinuous shift in the lowest PGB mass in one direction (that is 
either from x negative to x positive or vice-versa) it must go to zero continuously in the other direction. This follows 
from the fact that at least one PGB mass must vanish at a discontinuous transition. 

Summary Proposition: 

At a transition point x c at least one PGB mass vanish. The necessary condition for x c to be a transition point is 



= and — (x c )^0 (82) 
ax 

This summarizes the previous results. From our line of reasoning it is not possible to prove that the above condition 
is also a sufficient condition. Nor the above condition discriminates the transition as continuous or discontinuous. It 
should be mentioned that we assumed A(x) to be smooth in this discussion and that it has a non-zero linear part in 
x around x c . If A(x) is allowed to have discontinuities and/or singularities these will manifest themselves in S(x) but 
these are to be considered as contrived transitions since they can be generated at ease. 

At a continuous transition the PGB mass eigenstate will follow suite, that is, the eigenstate that hits zero has 
the same quantum numbers at both sides of the transition point. However at a discontinuous transition this is not 
expected since at different sides of the critical point the vacuum is discontinuously different. If on the other hand 
two eigenstates hits zero mass the situation will be more complicated since one will have to allow for an arbitrary 
superposition of the null space at the critical point regardless of wheter the transition is continuous or discontinuous. 



Tr 



^(x c )S(x c )T a 



V. EXACT SOLUTION FOR SU(2) 



The case of SU(2) is much simplified since the adjoint formulation maps the problem in 0(3)«SU(2), this is the 
reason an exact treatment is possible. The extremization condition reads 



A T = A (83) 

That is the rotated hamiltonian A = AS is symmetric. The reason for this simplification is that in SU(2) we have 
(T a )bc = i£abc, the fully anti-antisymmetric tensor. We have seen that the extremization conditions constraints parts 
of the perturbation that breaks parity symmetry. Here we realize its full effect: for the case of SU(2) all rotated 
hamiltonians are parity conserving. Furthermore the PGB mass-squared matrix is given as 
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M 2 = Tr(A) J - A (84) 

where / represents 3x3 identity matrix. The above means that M 2 and A can be diagonalized simultaneously. 
Now let us do a singular value decomposition of A as follows 

A = L T A d R (85) 

where L and R are 0(3) matrices and A^ is a positive semi-definite diagonal matrix. With this the extremization 
condition reads 

A d = KA d K (86) 

with K = RSL T . We also have K T K = I and detK = sign(detA). We assume without loss of generality that the 
determinant signature is carried by L, in such a way that finally one has det(S') = 1. 
After the singular value decomposition of A, the PGB mass-squared matrix becomes 

M 2 = Tr(A d K)I - L T A d KL (87) 

We can define a new matrix M 2 = LM 2 L T 

M 2 = Tr(A d K)I - A d K (88) 

so that M 2 and M 2 are related via an orthogonal similarity transformation. Thus their eigenvalues are identical. 
Now let us remember the extremization condition 

A d = KA d K 

this unambiguously means that 

A d = K n A d K n (89) 

for an arbitrary integer n, so at face value it seems there are countably infinite extrema. However let us recall that 
A d is diagonal with positive entries. Without loss of generality we can assume A d = diag(a, b, c) with a > b > c > 
f32| . If det(A) = we can always choose det(L) = det(i?) = 1 which would mean that det(K) = 1. We will deal 
with degeneracies in A d separately. The extremization condition along with this observation simply means that K is 
diagonal and satisfies K 2 = I. Therefor all the extrema fall into the following classes 



diag(+l,+l.+l) 

ma)>o^*h^1;;1;:1S <*» 

diag(-l,-l,+l) 



diag(+l,+l,-l) 

*.(A)<o^* = <£g!!; 

diag(-l,-l,-l) 

The fact that we are using a different set of K's for different signatures of det(A) is only to have a clear exposition. 
The careful reader might have already noticed that the set of K's for det(A) < are just the set of K's for det(A) > 
multiplied with —I. In fact, for SU(2) case, a negative determinant A can be written as —I A' where the singular 
value structure of A' is the same as that of A. This allows one to use only one set of K's, namely the one we use for 
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det(A) 


K 


E 


PGB 


Description 


1 


-L.1 -1-1^ 


— a — b — c 


( -A-n _I_ h -\-n J_ r> J-h _I_ /-A 

{-\-a -f- o, -f-a -+- c, -)-o -T cj 
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(+i,+i,-i) 


—a — 6 + c 


(+a + +a — c, +6 — c) 


Minimum (Index Extremum) 




(+1,-1, +i) 


—a + fe — c 


(+a + c, +a — 6, —6 + c) 


Index 1 Extremum 




(-i,+i,+i) 


+a — 6 — c 


(+b + c, —a + c, —a + 6) 


Index 2 Extremum 




(-1,-1,-1) 


+a + b + c 


(—a — &, —a — c, —6 — c) 


Maximum (Index 3 Extremum) 



TABLE I: The extrema structure for A,j = diag(a, b, c) with a > b > c > 0. 



det(A) > 0, but in this approach the maximum in one case is the minimum for the other. In view of this fact which 
may lead to unnecessary confusion we did not follow that line of exposition. 

Armed with (|9T)j. and (|9ip we now list the energies and associated PGB mass-squared values for all the extrema 
in TABLE fl] In the table we have also indicated the description of the extrema in terms of the language of Morse 
theory, where the index of a non-degenerate (that is no vanishing PBG masses in our language) extremum is given by 
the number of negative eigenvalues of the Hessian (simply the PGB mass matrix in our terms). We would also like 
to remind the reader that, as a check, the number of extrema satisfies the Morse inequalities. That is, the number 
of extrema of index i, is larger or equal to the z'th Betti number of the manifold. For SU(2)~S 3 the Bctti numbers 
are 1,0,0,1 and the number of extrema of our function are 1,1,1,1 which satisfies the inequality as they should. 
How can we make use of Morse inequalities? For instance for SU(3) the Betti numbers are 1, 0, 0, 1, 0, 1, 0, 0, 1. These 
mean, apart from stating that a minimum and a maximum must occur (which is intuitive anyways since SU(3) is 
compact), that there must be at least one extrema with index 3 and 5. That is, extrema with 3 and 5 negative PGB 
masses respectively. Other than this, Morse inequalities, unfortunately are not too useful for an explicit solution. For 
completeness we remind the reader that the Betti numbers of SU(N) are given by the coefficients of the respective 
power in the Poincare polynomial 

Psu(N)(t) = (1 + i 3 )(l + t 5 ) ■ ■ ■ (1 + t™- 1 ) (92) 
Degeneracies in A^: Let us remember that the extremization condition is 



A d = KA d K 

If there are degeneracies in A^ the behavior of the energy and the PGB masses can still be read from TABLE fl] 
However the solution set for K is larger. Let us start with an example. Assume that A^ = diag(a,a, c), then the 
possible solutions to the extremum condition are 

diag(+l,+l,+l) 

det(A) > => K = { k+(6) (93) 

diag(-l,-l,+l) 

diag(+l,+l,-l) 

det(A) < =^> K = { K-{6) (94) 

diag(- 1,-1,-1) 

where we have 

cos 9 sin 9 

k±(0) = ( sin(9 -cos(9 j (95) 
±1 
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det(A) 


K 


E 


PGB 


Description 


1 

~r 


( -1-1 - _I_ T - _I_ T ^ 
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— Zfi — C 
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+ 


(-1;-1;+1) 


+2a - c 


(—a + c, —a + c, —2a) 


Maximum (Index 3 Extremum) 




(+1;+1;-1) 


-2a + c 


(+2a, +a — c, +a — c) 


Minimum (Index Extremum) 




K.{6) 


— c 


(+a + c, , — a + c) 


Degenerate Critical Point 




(-1;-1;-1) 


+2a + c 


( — a — c, —a — c, —2a) 


Maximum (Index 3 Extremum) 



TABLE II: The extrema structure for Ad = diag(a, a, c) with a > c > 0. 



det(A) 


K 


E 


PGB 


Description 


+ 


(+1.+1.+1) 


-a - 26 


(+o + b, +a + 6, +26) 


Minimum (Index Extremum) 


+ 


(+1,-1,-1) 


-a + 2fe 


(+a - b, +a - 6, -26) 


Index 1 Extremum 


+ 


k+{9) 


+a 


( ,-a + 6, -a - 6) 


Degenerate Maxima 




K-{0) 


—a 


(+a + b, +a - b, ) 


Degenerate Minima 




(-1,+1,+1) 


+a - 2b 


(+26, -a + 6, -o + 6) 


Index 1 Extremum 




(-1,-1,-1) 


+a + 2b 


(-26, -a -6, -a -6) 


Maximum (Index 3 Extremum) 



TABLE III: The extrema structure for A^ = diag(a, 6, 6) with a > 6 > 0. 



These matrices obey k± = I and are called reflection matrices. The results for the degenerate cases are summarized 
in TABLES HT1 IIIII and HVl The matrices k±(8) and K±(6,<fi) referred in those tables are given as 

sin 9 \ 

±1 (96) 
-cos 9 J 



cos 2 8 sin 2(f> sin 28 sin tf> \ 
1 + 2 cos 2 8 cos 2 (f> sin 28 cos (f> (97) 
sin 28 cos (f> -l + 2sin 2 6>/ 

A. Summary of Results for SU(2) 

It is clear from our treatment of the case of SU(2) that a degenerate minima occurs if the two lowest singular 
values of A are degenerate and if determinant of A is negative. This condition is both necessary and sufficient. 
Now, what will happen, as we have discussed previously, if one has a A(x) and changes x to have an excursion in the 
solution space? The condition for a degenerate minima may be approached either by having the two lowest singular 
values to be the same and have det(A) crosses zero from positive to negative values, or having det(A) < and force 
the two lowest singular values approach each other, or both of these mechanisms. In all of these ways if A(a;) has a 
non-vanishing linear part in x around x c there will be a crossing from K = diag(+l, +1, —1) to K = diag(+l, —1, +1). 
Thus if A(x) has a non-vanishing linear part in x around x c the transition is a discontinuous transition 
in S. 

For SU(2) the rotated hamiltonian A^ fe ' is always real. The reason for this is that after hermiticity and T-invariance 
is imposed, any hamiltonian in the adjoint map has the following form 



det(A) 


K 


E 


PGB 


Description 


+ 


(+1,+1,+1) 


—3a 


(+2a,+2a, +2a) 


Minimum (Index Extremum) 


+ 


K+{e,ct>) 


+a 


(0,0, -2a) 


Degenerate Maxima 






—a 


( , , 2a) 


Degenerate Minima 




(-1,-1,-1) 


+3a 


(-2a, -2a, -2a) 


Maximum (Index 3 Extremum) 



TABLE IV: The extrema structure for Ad = diag(a, a, a) with a > 0. 



K±(0) 




and 



K ± {8) = ± 



-1 + 2 cos 2 (9 sin 2 . 
cos 2 8 sin 2(f 
sin 28 sin 6 
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/ A B \ 
A= C D 
\0 E/ 

Where, in our ordering of the generators, the upper-left block represents the real-symmetric generators. This matrix 
can be diagonalized within the space of the real generators, that is L and R matrices diagonalizing A do not mix 
real-symmetric and pure-imaginary anti-symmetric spaces of generators. Consequently the resulting S will not mix 
these either. As a remark we also state that under these conditions, L and R will carry a single rotation angle each. 
The reason for this is that there is only one pure imaginary generator, a circumstance related to the fact that all 
representations of SU(2) are pseudo-real, that is a complex conjugation operator exists within the algebra [33|. Of 
course if one relaxes the T-invariance condition and consequently allow A to be a generic real 3x3 matrix one will 
have complex rotated hamiltonians A^- . 

In the previous paragraph we do not mean that the corresponding W for a given S will always be real. One can 
show using the K's we have found and the structure of L and R following from the discussion above that W is one 
of the following 

/ cos (9 sin6*\ / icoscj) i sin cj> \ / — i sinep i cos cj> \ /— sin 9 cos 9 
\ — sin# cos 9 J ' lisin^ — icoscj) J ' I icoscj) isincp J ' I — cos 9 — sin0 

here 9 = (s + 1)/2 and <j) = (s — t)/2 where s and t represent the rotation angles in L and R respectively. 

As can be seen from the complexity of W comes as an overall factor and consequently the rotated hamiltonian 
A 1 ^ 1 will be real. So the solutions are always in the CPC phase. We would like to remind the reader that this happens 
because the energy function we are minimizing does not care about the overall phase of W. Of course if there are 
other perturbations such as a mass term for the spinor these conclusions will change since the solution space will be 
altered. 



B. Frequency of Light PGB Masses 

Recently there has been considerable attention on models where the Higgs particle is a naturally light goldstone 
boson [l6|, [I?], [H, E^l • This possibility is also advocated in within the context of the problem we are dealing with 
Q. So the question is: assuming we are sampling models from a random set what is the probability of having a light 
PGB mass? 

Our treatment of the problem for SU(2) is exact and in fact it is applicable not only for originally T-invariant 
hamiltonians but also for generic cases where A is a 3x3 real matrix. As a consequence of this exact treatment the 
possibility for a vanishing PGB mass can be analyzed analytically. Even though this can be carried out it would be 
very cumbersome, so we resort to Monte-Carlo methods. 

We present our findings for the following random samples. Models of type I are full hamiltonians where T-invariance 
is not imposed, that is they are 3x3 matrices. Models of type II are full hamiltonians where T-invariance is imposed, 
that is they are block diagonal as discussed before. Models of type III are models where A is diagonal and hence 
automatically T-invariant, thus one can see this type as constrained type II where off-diagonal entries are taken to 
be zero from the outset, thereby introducing the element of hierarchy in A. For each of these types we present three 
flavors. Flavor A where the elements are sampled from [0,1], flavor B where elements are sampled from [—1,0] and 
flavor C where elements are taken from [—1,1]. 

For all these types and flavors we have sampled 3.2 10 6 models and solved the minimization problem numerically. 
In numerical minimization algorithms one can not be sure that the computed minimum is a g lobal one. However, for 
SU(2), our exact treatment shows that modulo Zn = 2 degeneracies the minimum is unique [34] and thus the minimum 
found numerically is the global minimum. The normalized probability distributions of the PGB masses are in Fig. [T] 

Models IA and IB have the same PGB distributions, this is clear since sampling space of flavor B is just the negative 
of sampling space of flavor A. Since minimum and maximum structure of the solutions in SU(2) are symmetric this is 
expected. This behavior is also present in models IIA and IIB. However this symmetry does not exist between models 
IIIA and IIIB since in IIIA det(A) > by construction and a PGB mass never vanishes. 

What is remarkable is that for models of type II and III there is considerable probability for the lightest PGB 
mass to be near zero. This behavior is more emphasized for type III (except for IIIA where it is impossible to have a 
vanishing PGB mass from the outset). This is to be contrasted to the behavior of models of type I where T-invariance 
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FIG. 1: Normalized PGB mass-squared distributions for models discussed in the text. The columns and rows represent model 
type and flavor respectively. For example the rightmost graph in the middle row represent results model MODEL IIC. It is self 
evident which plot represents which PGB mass in a given graph but for clarity we picked red, black and green to represent the 
lightest, middle and heaviest PGB mass. The fluctuations in the graphs are representative of the finite number of samples we 
have taken for each model. 

is not imposed. It is understandable that when one has less number of degrees of freedom it becomes more probable 
for those parameters to contrive so that the condition for a light PGB mass is satisfied. 

We know from Morse lemma that functions with non-degenerate critical points are dense within the space of 
functions. That is, in our language, we should expect a generic model to have no zero PGB masses. In fact we 
observe this lemma; the probability for an exactly vanishing PGB mass is zero. Nevertheless the probability for a 
low PGB mass is considerable. What is meant by a truly generic function in Morse theory translates to our study 
as a model where T-invariance is not a priori imposed. So the key element in the occurrence of light PGB masses 
is the assumption of T-invariance since this constrains the models. Constraining models further such as going from 
models of type II to type III makes the light PGB masses more likely. This constraint (vanishing of some elements) 
introduces a hierarchy in the explicit symmetry breaking hamiltonian. 

The horizontal axis figures [T] and [5] represent the PGB mass squared value and the vertical axis is dimensionless 
since it represents the probability distribution. In terms of dimensionfull parameters the x axis is normalized in terms 
of Att/(M 2 F 2 )- The scale M of the explicit perturbation must not interfere with the scale AttF of the spontaneous 
symmetry breaking otherwise it can not be called a perturbation. Thus it would be desirable that the largest PGB 
mass is at most of order F, the decay constant of the Goldstone bosons. For instance for the case of SU(3) which we 
will present below we find that about % 14 of the models have a PGB mass below 0.1F for models in the set IIIB. 

VI. DIGRESSION ON SU(3) 

Unfortunately an exact treatment of SU(3) case is not readily available. The adjoint map transforms the problem 
into 0(8) matrices and a singular value decomposition of A is not available as was in the case of SU(2). Even for 
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FIG. 2: Normalized PGB mass-squared distributions for the SU(3) models discussed in the text. The models types and flavors 
are the same as in subsection IV Bl The columns and rows represent model type and flavor respectively. For example the 
rightmost graph in the middle row represent results model MODEL IIC. 

an originally diagonal A the analysis is difficult and we reserve it for future work. For certain simple cases an exact 
treatment could be possible but we do not pursue this here either. In SU(3) the interesting generalization is that the 
rotated hamiltonian can have rational phases and thus PCP solutions are possible. In this section we present our 
findings after the same Monte-Carlo analysis we did for SU(2). Since we do not have an exact solution for SU(3) we 
can not say that the minimum is always unique and consequently the honest statement is that the numerical study is 
dealing with local minima. 

The PGB mass distributions are presented in Fig [5] We also present the percentage occurrences of various phases 
in Table El 

The first thing to notice is that models IA and IB (and IIA and IIB) do not have the same PGB distributions, 
unlike the SU(2) case. This is hint about an asymmetry in the minimum and maximum structures of a given model 
with respect to A — > —A. As it was the case in SU(2), we see that the probability of occurrence of a light PGB mass 
is considerable in models of type II and III compared to type I where T-invariance is not imposed a priori. These 
probabilities are larger than the corresponding ones in SU(2). This is understandable because in SU(3) imposing 
T-invariance constrains more elements out of the total possibilities compared to SU(2). In type III models this effect 
is much more emphasized, except for IIIA, where by construction the minimization is always achieved by S = I. 
As a minor note we would like to point out that the PGB mass distributions for models IA show an almost-good 
SU(2) symmetry since the PGB masses seem to form clusters with cardinality 1 + 3 + 2 + 2 coinciding with the SU(2) 
decomposition of the adjoint of SU(3), that is 8 = 1 + 3 + 2 + 2*. For the other cases such an observation is not 
possible and SU(2) is badly broken. Thus the behavior of PGB mass distributions is similar to what we have obtained 
in SU(2) except that in SU(3) an almost vanishing PGB mass is more probable. 

We see from Table IVl that for all models of type I the solutions are in the CPV phase. This makes sense because 
in these models T-invariance is not originally imposed. What is illuminating is that for models of type II there are 
no PCP phases. In reality, we have observed a tiny fraction of models resulting in the PCP phase. But this was less 
than the statistical error rendering the result compatible with 0. This observation implies that the PCP phases are 
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MODEL 


CPC 


PCP 


CPV 


IA 


0.0 


0.0 


100.0 


IB 


0.0 


0.0 


100.0 


IC 


0.0 


0.0 


100.0 


IIA 


86.3 


0.0 


13.7 


IIB 


84.0 


0.0 


16.0 


IIC 


77.4 


0.0 


22.6 


IIIA 


100.0 


0.0 


0.0 


IIIB 


41.3 


44.7 


14.0 


IIIC 


62.0 


31.4 


6.6 



TABLE V: The occurrence percentages of different phases for the SU(3) models studied. Model types and flavors are the same 
as in subsection IV Bl All numbers have a statistical error of 0.1 percent. 

not living in the bulk of the space of most general models with original T-invariance. This is expected since PCP 
phases are represented by O(N) type orbits that is not connected to the identity element and thus occur with more 
contrivedness than the CPC cases which are proper O(N) subgroups. For type IIIA since the minimization is always 
achieved by S = / all models yield a CPC solution. The PCP phases show themselves in models IIIB and IIIC. We 
therefor conclude that in SU(3) CPC phases are more common and PCP phases occur with respectable frequency 
only in models where there are more constraints than just T-invariance. The benchmark model studied in 0] is 
such an example. 

Therefor as a general conclusion we simply state that within the context of vacuum alignment and the possible 
subsequent spontaneous breaking of the CP symmetry there is considerable possibility for a light PGB mass if in the 
original hamiltonian T-invariance is imposed. As a speculation we would like to argue that it seems as if a similar 
mechanism to that of the Peccei-Quinn model operates, of course only in a statistical sense [35j . 

VII. CONCLUSIONS AND FUTURE DIRECTIONS 

We studied in detail the vacuum alignment problem in models where a global chiral symmetry first spontaneously 
breaks to its vectorial subgroup and later is explicitly broken by a perturbation of four-Fermi type. We exposed 
different CP behavior classes and transitions between those classes, presented an explicit solution for two flavor case, 
SU(2). We have also shown with statistical evidence that there is an increased chance for a light pseudo-Goldstone 
boson if the perturbation has an ab initio CP invariance. 

The most obvious thing to pursue next is to be more energetic for finding an analytical treatment for the three 
flavor case SU(3), since this is the case with the lowest number of flavors where all of the possible CP violation classes 
are present. A complete list of extrema like that of SU(2) is what one desires, however there is no clear path to follow 
for this. A recipe to discriminate the CP structure of the minimum is actually enough to understand the behavior of 
the model. As we have mentioned there are simplified models where an exact treatment is possible and a diagonal 
A a b seems to be the best starting point. 

Another avenue which is of interest is to allow mass terms. In a model generalized in this manner the energy 
function to minimize becomes 

E(W) = -aTr [MW + W^M^] - /3A ah Tr [W^\ a W\ b ] . (99) 

In this work we have studied a — case. The other extreme, (3 = 0, is a well known system with a research literature 
too wide to enumerate here. See however recent interesting results in (9l. ITo|. It will be interesting how an interplay 
between these two terms can change things. Works along these lines are in progress. 
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As an explicit example one can take Ho as describing a scaled-up QCD type model. 

Of course the set of compatible vacua may be described by a single state or a set of states degenerate continuously or 
discretely. 

This is necessary because the perturbations which will explicitly break the chiral symmetry must really be perturbations 
on top of the spontaneous breaking. If the explicit breaking terms are more important than the mechanism triggering the 
spontaneous breaking there is no sense in talking about a symmetry at all 

The Dashen procedure, however, will change all these interactions accordingly when one rotates the full perturbing hamil- 
tonian to become compatible with the standard vacuum 

In this work we use the word quadratic to emphasize that the energy function is proportional to WuW^ k . A general 
quadratic form will depend on det(VK). 

Of course in the presence of gravity a constant addition to the vacuum energy becomes important and can not be neglected 
This seems to spoil the invertibility of the decomposition. But terms with Ao represents elements of A ,:,fc ' which are 
irrelevant from the outset. 

Needless to say one can carry all the formulas in the original formulation to the adjoint map by brute force computation, 
using the decomposition in (|39[) . 

Since we are using the active point of view, that is we rotate the perturbation not the vacuum, the definition of parity 
symmetry with respect to the standard vacuum is not altered. So the extremization condition is really trying to minimize 
parity breaking. 

Of course in this sea- island analogy one must remember that the dimensionalities of the islands and sea are different. 
If there is one vanishing PGB mass one does not necessarily have = , this would contradict the original assumption. 
In this case all is needed is that s|j~ and must be proportional to each other. 

This degeneracy is to be distinct from the E(W) = E(W) degeneracy. We also remind the reader that Zn degeneracies 
are absent in the adjoint map of the problem. 

This particular ordering of the singular values is not a must and our treatment of the problem can be carried out for any 

other ordering. We choose this form for simplicity in laying out the results 

In the usual ordering of the generators this is given as ioi. 

Except of course when it is accompanied with a vanishing PGB mass 

To arrive at this statistical conclusion we made use of random models sampled from regions [0, 1], [—1,0] and [—1, 1]. One 
can object to our treatment in view of the fact that since the origin is part of the sampling space some elements are probably 



24 



much smaller than the others. This is a legitimate criticism even though the number of hierarchically separated elements 
is low with high probability. Nevertheless we have also studied the mentioned models with sampling regions of the form 
[1, 2], [—2, —1] and [—2, — 1] U [1, 2] where no hierarchy whatsoever is present. The structure of the PGB mass distributions 
are the same albeit the distributions are lower at zero mass, meaning that the effect is present but less emphasized. So 
hierarchy within the elements of A ab is important. But we have already seen this concept in our discussion; there models 
of type III which were models of type II with only diagonal entries yielded larger probabilities for small PGB masses. 



